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Abstract 

We describe the way in which the sign of the Schwarzian derivative for a family of diffeomor- 
phisms of the interval / affects the dynamics of an associated many-to-one skew product map 
of the cylinder (R/Z) x I, 
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1 Introduction. 

Ittai Kan has described a simple example of a skew product map from the cylinder (R/Z) x / to 
itself such that the two boundary circles are measure theoretic attractors whose attracting basins 
are intermingled ', in the sense that the intersection of any nonempty open set with either basin has 
strictly positive measure. (See |Kan, 1994| .) This note will consist of three variations on the maps 
which he introduced. 

Sections |31 and 4 will describe Kan's example in slightly more generality, emphasizing the impor- 
tance of negative Schwarzian derivative. Section [5] will show that if we substitute positive Schwarzian 
derivative then the behavior will change drastically, and almost all orbits will have a common 
asymptotic distribution. In the case of zero Schwarzian derivative, ^Jwill prove in some cases (and 
conjecture in others) that almost all orbits spend most of the time extremely close to one of the 
two cylinder boundaries; but that each such orbit passes from the e-neighborhood of one boundary 
circle to the e-neighborhood of the other infinitely many times on such an irregular schedule that 
there is no asymptotic measure. 

Most technical details are relegated to the two appendices. 



2 Preliminaries. 

Let I = [0,1], and let C be the cylinder (R/Z) x I with boundaries Aq = (R/Z) x {0} and 
.4.1 = (R/Z) x {1}. Let F : C be a C 3 -differentiable map of the form 

F(x,y) = (kx, f x (y)) , (1) 
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(3x, f (y)) 



(x,y) 



(0, 1) fixed 



(0, 0) fixed 



Fig. 1: The cylinder map F in the case k = 3. 



where k > 2 is a fixed integer, and where each f x : I — ► I is a diffeomorphism with f x (0) = and 
f x (l) = 1. Thus the derivative 

f'M = dfx(y)/dy 

must be strictly positive everywhere. 

We next introduce two key concepts that will be needed. 

LEMMA A.l. For i equal to zero or one, let B t be the attracting basin of the circle 
A L . If the transverse Lyapunov exponent 



Lyap(A 



log {f x (i))dx 



(2) 



is negative, then the basin B L has strictly positive measure. In fact, for almost every 
x £ IR/Z the basin B L intersects the "fiber" x x I in an interval of positive length. On 
the other hand, if Lyap(^4 t ) > then B L has measure zero. 

The proof will be given in Appendix A. 

In fact, whenever Lyap(v4 t ) < it is not hard to see that the circle A L is a measure attractor. 
By this we mean that it satisfies the following two conditions: 

1. A L is a minimal measure attracting set, that is, it has an attracting basin of positive measure, 
but no closed proper subset has a basin of positive measure. 

2. Furthermore, A L contains a dense orbit, and hence cannot be expressed as the union of 
strictly smaller closed invariant sets. 1 

Recall that the Schwarzian derivative of an interval C 3 -diffeomorphism / is defined by the formula 



Sf{y) 



f"'{y) 
f'(y) 



f"(y) 

m 



(3) 



1 The following example shows that Condition 2 does not follow from Condition 1. Consider a flow in the plane 
such that all orbits near infinity spiral in towards a figure eight-curve, while all orbits inside either lobe of the figure- 
eight spiral in towards an attracting equilibrium point. Then the figure-eight is a minimal measure attracting set 
with no dense orbit. 
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We will make a particular study of maps F(x, y) = (kx, f x {y)) such that the Schwarzian Sf x {y) 
has constant sign for almost all (x,y) G C. Maps f x with S(f x ) < almost everywhere have 
the basic property of increasing the cross-ratio p(yo, Vi, 2/2> IJ3) for all yo < yi < V2 < y3 in the 
interval. (See Appendix B.) Similarly, if S(f x ) > (or if S(f x ) = 0), then f x will decrease (or 
will preserve) all such cross-ratios. 

3 Negative Schwarzian. 

LEMMA 3.1. If Sf x {y) has constant sign {positive, negative or, zero) for almost all 
(x, y), then Lyap(_4o) + Lyap(.Ai) has the same sign. In particular, if Sf x (y) < for 
almost all (x, y), then 

Lyap(A) + Lyap(.Ai) < , (4) 
hence at least one of the two boundaries has a basin of positive measure. 

Proof. Lemma B£2](in Appendix B) will show that f x (0)f' x (l) < 1 whenever f x has negative 
Schwarzian. Integrating the logarithm of this inequality over M/Z, the inequality (j3J) follows. Thus 
the transverse Lyapunov exponent is negative for at least one of the two boundaries. Hence the 
associated basin has positive measure by Lemma ATT1 □ 

THEOREM 3.2. // Sf x {y) < almost everywhere, and if both basins have positive 
measure, 2 then Bo U B\ has full measure. In fact, there is an almost everywhere defined 
measurable function a : M/Z — > I such that 

(x,y) £ Bo whenever y < a(x) , 

(x,y) £ B\ whenever y > a(x) . 

More generally, the same statement is true if the k-tupling map on the circle is replaced by 
any continuous ergodic transformation g on a compact space with g-invariant probability 
measure. 

In fact we will usually consider maps f x (y) for which the behavior of F near the two boundaries 
is similar enough so that Lyap(.Ao) and Lyap(^4i) are equal to each other (or at least have the 
same sign). For such maps, the condition Sf x < will guarantee that both attracting basins have 
positive measure. 

Proof of Theorem 13.21 Since each f x is an orientation preserving homeomorphism, there are 
unique numbers < o"o(x) < cr\{x) < 1 defined by the property that the orbit of (x,y): 

converges to ^4o if V < 00 0*0 
converges to A\ if y > (J\(x) 
does not converge to either circle if o~o(x) < y < a\(x) . 



2 We don't know whether this hypothesis is necessary. 
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Thus, the area of Bo can be defined as J ao(x)dx. Since this is assumed to be positive, it follows 
that the set of all i£l/Z with o~o(x) > must have positive measure. In fact, the evident identity 
ao(kx) = /z(cxo(x)) implies that this set is fully invariant under the ergodic map x kx. Hence 
it must actually have full measure. Similarly, the set of x with a\{x) < 1 must have full measure. 

We will make use of the property that a map f x of negative Schwarzian derivative increases the 
cross-ratio 

en i\ 2/2(1-2/1) 
P{0, VI, 2/2, 1) = — jz r , 

yi (i - 2/2) 

that is: 

p(0, fxivi), fxfa), 1) > p(0, yi, y 2 , 1) > 1 for all < y x < y 2 < 1 . 

(See Lemma ETH) Suppose that the inequalities < ao(x) < <Ji(x) < 1 were true for a set of 
x £ R/Z of positive Lebesgue measure, then the function 

r{x) = p(0, a (x), cri(x), l) > 1 

would satisfy r(kx) > r{x) on a set of positive measure, with r(kx) > r(x) everywhere. It would 
follow that 

dx f dx 

< 



r(kx) J r(x) 

R/z R/z 

But this is impossible: Lebesgue measure is invariant under push-forward by the map x 1— > kx, and 
it follows that J <p{kx) dx = J 4>{x) dx for any bounded measurable function cf). This contradiction 
proves that we must have (Tq(x) = a\(x) almost everywhere; and we define a(x) as this common 
value. □ 



Remark 3.3. We can then define the separating measure [3 on C to be the push-forward, under 
the section, x 1— * (x, cr(x)), of the Lebesgue measure X x , on M/Z. Evidently (3 is an F-invariant 
ergodic probability measure which in some sense describes the "boundary" between the two basins. 
Since < cr(x) < 1 almost everywhere, it follows easily that both boundaries have measure 

/3(A) = 0. 



4 Intermingled Basins. 

Now assume the following. 

Hypothesis 4.1. There exist angles x~ and x + in R/Z, both fixed under multiplication 
by k, such that f x (y) < y for all < y < 1 and all x in a neighborhood of x~ , and such 
that f x (y) > y for all < y < 1 and all x near x + . 

It follows that the entire vertical line segment {x~} x [0, 1) is contained in the basin Bo, and 
that the entire segment {x + } x (0, 1] is contained in the basin B\. 



THEOREM 4.2. (Intermingled Basins). // Hypothesis \4-f\ is satisfied, and if both 
basins have positive measure, then the two basins are intermingled . That is, for every non- 
empty open set U C C, both intersections BoHU and B1P1U have strictly positive measure. 
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Proof. Define measures po and p\ on the cylinder by setting p L (S) equal to the Lebesgue 
measure of the intersection B L H S for l equal to zero or one and for any measurable set S. Clearly 
the support supp(^J, that is the smallest closed set which has full measure under p t , is fully 
F-invariant. We must prove that this support is equal to the entire cylinder. 

To begin, choose any point (xo, yo) £ su PP(/ x o) with < yo < 1. Construct a backward orbit 

••• >-> (x-2,y-2) >-> (x-i,y-i) i ^ (x ,yo) 

under F by induction, letting each x_a. +: n be that preimage of which is closest to x~ . Then it 
is not difficult to see that this backwards sequence converges to the point (x~ , 1). Since supp(//o) is 
closed and .F-invariant, it follows that {x~ , 1) G supp(/io)- But the iterated pre-images of (x~ , 1) 
are everywhere dense in the upper boundary circle A\, so A\ is contained in supp(^o)- Since the 
basin Bo is a union of vertical line segments x x [0, o"o(x)) or x x [0, o"o(x)] , it follows easily that 
supp(/^o) is the entire cylinder. 

The proof for fii is completely analogous. □ 

Remark 4.3. In place of a fixed point on the circle, we could equally well use a periodic point 
k p x = x (modZ). It is only necessary to check that the iterated map F op satisfies the required 
hypothesis. 




Fig. 2: Intermingled basins for the cylinder map F of Example \4.4\ 

Example 4.4. Following |Kan, 1991] , let 

Qa(y) = y + ay(l - y) . (5) 

If \a\ < 1, then q a maps the unit interval diffeomorphically onto itself, with q a (0) = and 
g a (l) = 1. It is easy to check that Sq a {y) < whenever a / 0. It then follows from Lemma I PI 
that q a (y) has the property of increasing cross-ratios. 

Choose < e < 1, and let p(x) = e cos(27rx). Then for any k > 3 the map 

F(x,y) = (kx, f x (y)) where f x (y) = q p ( x) {y) 

will satisfy Hypothesis 14.11 and also the hypotheses of Theorem 13.21 In fact, we can take x + = 0, 
and choose x~ to be a fixed point which lies between 1/3 and 2/3. For example, take 




1/2, for k odd, 

k/(2k-2), for k > 4 even. 
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Thus we obtain explicit examples of maps with intermingled basins. (Compare Fig. |2J) 

(In fact this argument will work for k = 2 also, using the periodic orbit 1/3 <-> 2/3 in place of 
a fixed point.) 

Remark 4.5. Very similar examples of intermingled basins can be observed in rational maps 
of the projective plane. (Compare |Bonifant, Dabua and Milnor, 20061 §6]-) It would be very 
interesting to know to what extent the examples in the following two sections also have analogs 
among such rational maps. 

5 Positive Schwarzian 

In this section we continue to study the cylinder maps F(x,y) = (kx, f x (y)) , but now assume 
that Sf x > almost everywhere. 

THEOREM 5.1. If Sf x {y) > for almost all (x,y), then at least one of the transverse 
lyapunov exponents Lyap(.Ao) and Lyap(«4i) is strictly positive. If both are strictly positive, 
then F has an asymptotic measure? That is, there is a uniquely defined probability measure 
v on the cylinder such that, for Lebesgue almost every orbit (xo,yo) l— ► (xi,yi) 
and for every continuous test function \ '■ C — > R, the time averages 

^ 71—1 

Vij) 

i=0 

converge to the space average J c x(x,y) dv(x,y) as n — > oo. {Briefly, almost every 
orbit is uniformly distributed with respect to v.) Furthermore, both boundaries of C have 
asymptotic measure v(Ao) = v(A.i) equal to zero. 

Thus, under these hypotheses, almost all orbits of F have the same asymptotic distribution. 

Outline of the Proof. Since the proof of this theorem will be slightly circuitous, we first 
outline the main steps. 

• First the circle R/Z of the previous section will be replaced by the solenoid 

S = S fc = lim(R/k n Z) , 

and the many-to-one map F of (R/Z) x / will be replaced by the associated invertible map 
F from E X J to itself. 

• Since F -1 has negative Schwarzian on each fiber, Theorem 13.21 asserts that the union of the 
attracting basins of the two boundaries S x {0} and S x {1} under the map F^ 1 will 
have full measure. In fact there is an almost everywhere defined section 

x i ^ ( x, a(x) ) (6) 

from S to S x I which "separates" the two attracting basins. 
3 Terms such as: natural measure or physical measure are also used in the literature to denote this type of measure. 
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• There is a standard ergodic invariant probability measure /is on the solenoid. Pushing it up 
to the graph of a under the section we obtain an ergodic invariant probability measure 
v on £ x /. 

• Since almost all points are pushed away from the graph of a by the inverse map F , it 
follows that they are pushed towards this graph by the map F. In this way, we see that V 
is an asymptotic measure for F . 

• Finally, we denote by v the push-forward of v under the projection 

S x / -> (R/Z) x /. 
This will be the required asymptotic measure for the original cylinder map F. 




Fig. 3: 50000 points of a randomly chosen orbit for the cylinder map F of Example \5.2\ 

Example 5.2. One example of a family of interval diffeomorphisms with positive Schwarzian 
is given by the inverses 

fx(y) = q~( x) {y), 

where q p ( x )(y) is the quadratic map © of Example 14.41 Here Sf x (y) > whenever p(x) ^ 0. 
(Compare Proposition ETTlin the appendix.) For this special example, the asymptotic measure v 
turns out to be precisely equal to the standard Lebesgue measure A on the cylinder. In other 
words: 

Randomly chosen orbits are uniformly distributed with respect to Lebesgue measure. 

(Compare Fig. |3J). To prove this statement, one only needs to show that Lebesgue measure is F- 
invariant. In fact there are k branches of F^ 1 on any small open set U C C, each given by 

F' 1 ^^) = (x/k, y + ecos(27tx/k)y(l-y)) (7) 
for one of the k choices of x/k (mod Z). The Jacobian of this branch (JJJ) is equal to 

(l + e cos(27rx/£;)(l - 2y)) /k . 

Since the sum of cos(27rx//c) = 3f?e 27nx//fc over the k choices for x/k is zero, the sum of Jacobians 
is +1, which means that F preserves the Lebesgue measure A. Now Theorem 15.11 asserts that 
an asymptotic measure exists. Such a measure v must necessarily be equal to the weak limit of 
(A + F^X + • • • + i ? * n ~ 1 A) /n as n — > oo; and it follows that v is precisely equal to A. 
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Proof of Theorem 15.11 The argument begins as follows. Denote by E the solenoid of 
backwards orbits 

X = {•■■>-> X-2 l-> X-l I ^ X } = {X- n } 

under the map x \— * kx. Thus E maps homeomorphically onto itself under multiplication by k, 
with the right shift map 

x-2 ► a?— i > xo} i— ► {• • • i— > x_3 i — * x-_2 ► £-1} 

as inverse. There is a standard probability measure fis on E, defined by the requirement that 
each projection x i— > x- n is measure preserving. 

LEMMA 5.3. \\x\\/k < \\kx\\ < k\\x\\ for all SeS. 

Proof. This follows easily from the definition. □ 

Given maps f x (y) as in £jH we again consider the associated map F : C — > C. If Sf x (y) > 
almost everywhere, then by the analogue to Lemma lM.l[ 

Lyap(A)+Lyap(A) > 0. 

There is also a natural extension F : S x I — * E x / of the map F. This is a homeomorphism 
defined by the formula 

F(x, y) = {kx, UM)- (8) 



LEMMA 5.4. // Sf x {y) > for almost all (x,y), and if both Lyap(^4o) and Lyap(y4.i) 
are strictly positive, then there exists a measurable function a : S — ► /, defined almost 
everywhere, and satisfying the identity 

o-(kx) = f X0 (<r(x)) 

for almost all ieE. It follows that the graph of a, that is the set of all pairs 

(x, aix)) G E x I, 

is invariant under the extended map F : Ex/-»Ex/, so that F(x, o~(x)) = (kx, o~(kx)) 
for almost all x. 

Caution: In cases of interest, this function a will not be continuous and will not be everywhere 
defined. 

Proof of Lemma 15.41 We apply Theorem 13.21 to the inverse map F~ l , with the A:-tupling 
map on the circle replaced by the right shift map on the solenoid. This yields a measurable section 
x i — > (x, <t(5?)) from E to Ex/. The required F-invariance property then follows easily. □ 

Next we will show that almost every orbit under F converges, in a suitable sense, to the graph 
of a. Recall that o~(x) is well defined and belongs to the open interval (0, 1) for almost every 
x £ E. Thus, for almost every point (x, y) £ E x (0, 1), the quantity 

r(o-(x), y) = | logp(0, a(x), y, l)| > 

is defined and finite, vanishing if and only if y = o~(x). We will think of r(a(x), y) as a measure 
of distance between o~(x) and y. 
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LEMMA 5.5. Under the same hypothesis, for almost every orbit (xq, yo) > (xi yi) >— > 
■ ■ ■ under F this measure of distance r(o~(x n ), y n ) converges to zero as n —* oo. 

Proof. Since the map Fix, y) = (kx, f xo {y)) decreases cross-ratios on each fiber (compare 
Lemma rO|). we have 

r(a(kx), f xo (y)) < r(a(x), y) 
almost everywhere. For any constant ro > 0, let N(tq) be the strip consisting of all 

(x, y) G £ x (0, 1) with r(a(x), y) < ro . 

Evidently N(ro) is mapped into itself by F. Given constants < ro < ri, we will also consider 
the difference set N(ri)\N(ro). Let s(x) < 1 be the supremum of the ratio 

r(a(kx), f X0 {y)) 

for (x, y) G A(n)\iV(ro) , (9) 

or in other words for ro < r(a(x), y) < r\. Since the Schwarzian is positive almost everywhere, it 
is not hard to see that this supremum satisfies six) < 1 for almost all x. (Here we make essential 
use of the fact that ro > 0, since if the Schwarzian vanishes at (x, a(x)) then the ratio Q would 
tend to 1 as y tends to <r(af).) 

Therefore the average of logs(x) over the solenoid is strictly negative. A straightforward 
application of the Birkhoff Ergodic Theorem then shows that, for almost every Xq \— * x\ i— > ■ ■ ■ , 
some partial product of the sixj) satisfies 

s{x ) ■ ■ ■ s(x n -i) < r /n. 

This means that the iterate F on maps N(r\) into N(ro). Since < ro < r% can be arbitrary, 
this completes the proof of Lemma 15.51 □ 

Now define the probability measure v on Ex/ to be the push-forward of the standard 
measure /is on the solenoid under this section a : x i— ► (x, cr(x)). 

LEMMA 5.6. This v is an asymptotic measure for the extended map F : E xJ — * E x I. 

Proof. We know that almost every orbit (xq, yo) l— * (^lj 2/1 ) l— > ■ • ■ under F converges (in 
the sense of Lemma l5.5|) towards the graph of a. If x : E x / ^ ]R is any continuous test function, 
then it follows easily that the difference between the time averages 

n— 1 n— 1 ra— 1 

(^X{xi,yi)}/n and ( ^ x@h °"(^))) /" = (^^(^(^i)))/" 
o oo 

converges to zero as n — > oo. But the Birkhoff Ergodic Theorem, applied to the bounded mea- 
surable function p?:S-»l, asserts that this last time average converges towards the space 
average 

\ o <///-(.)') - J ;\(.r. (j)du(x. //) . 

ExJ" 
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as required. □ 

Proof of Theorem 15.11 (conclusion). If Sf x > it follows from Lemma EP1 and Lemma 
AfTI that at least one of the transverse Lyapunov exponents of the boundary circles is strictly 
positive, hence its corresponding basin has measure zero. If both, Lyap(^4o) an d Lyap(«4i) are 
strictly positive, the same lemma implies that the set of orbits converging to Aq U A\ has measure 
zero, so that the Lemmas 15.41 through 15.61 apply. 

Pushing forward the canonical measure //s on £ by the section a : x *— > (x, o~(x)), we obtain an 
asymptotic measure v = 3*(/4s) for the map F. Now, pushing forward again under the projection 

(x, y) i ^ (x Q , y) 

from £ x I to (R/Z) x I = C, we obtain an .F-invariant measure v on C. Since almost every orbit 
under F is uniformly distributed with respect to it follows that almost every orbit under F is 
uniformly distributed with respect to v. □ 

Remark 5.7. In the spirit of Remark 13.31 one could say, that the separating measure for F^ 1 is 
an asymptotic measure for F. 



6 Zero Schwarzian 

This section will study the intermediate case where each diffeomorphism f x :I—*I has Schwarzian 
Sf x identically zero. Such a map is necessarily fractional linear, and can be written for example 
as 

V -> : , ^ 7s with a > 0, (10) 
1 + (a - l)y 

where a is the derivative at y = 0. It will be convenient to replace y by the Poincare arclength 
coordinate 

t(y) = logp(0, 1/2, y, 1) = log ^- , (11) 

which varies over the entire real line for < y < 1, with inverse y = e*/(l + e*). If we embed the 
unit interval in the complex open disk of radius 1/2 centered at 1/2, then \t\ can be described as 
the distance from the midpoint, using the Poincare metric for this disk. (Compare Appendix B.) 

Since we are assuming Sf x identically zero, it follows that each f x preserves cross-ratios 
or Poincare distances. (See Equations 1)1 ljl and (jB4|) of Appendix B.) Therefore, in terms of the 
Poincare arclength coordinate t, the map f x will simply be a translation, t i— > t + c where c 
is a constant depending on x. In other words, 

t(fx(y)) = t(y) + c ) where c = log(a) G R or a = e c . 

Using this displacement c in place of the original parameter a , the 1-parameter group of fractional 
linear transformations of the unit interval takes the form 

c c y 

9c(y) = r. / " tt - i 
1 + (e c - \)y 
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where g c + c ' = 9c ° 9c' ■ Given any smooth function p from R/Z to R, we can set c = p(x) to 
obtain an associated cylinder map 

F(x, y) = (kx, g p ( x )(y)) ■ 

(The map F has an absolutely continuous invariant measure dxdt. However, this is not very 
useful since the total area J J dxdt is infinite.) 

Using the coordinate t G R in place of y £ (0,1), the cylinder map F will correspond to 
the map 

(x, t) i ^ (kx, t + p(x)) 
of (R/Z) x R. The dynamics of F under iteration is governed by the average 

A = j p(x)dx (12) 

of the displacement p(x). For almost any orbit (xq, to) l— *• ■ ■ ■ l— ► (^n 5 ^n) l— * ■ • • > it follows from 
the Birkhoff Ergodic Theorem that the time average 

(t n -t )/n = (p(xo)H |-p(ar n _i))/n 

converges to the space average ^4 as n — > oo. Thus if A > then it follows that t n will converge 
to + oo. In other words, the corresponding orbit for F will converge towards the upper cylinder 
boundary A\, so that A\ will be a global attractor under F. Similarly, if A < then the lower 
boundary .Ao will be a global attractor. 

The borderline case where the average (|12j) is exactly zero, is much more interesting. We con- 
jecture that the long term behavior of the sequence of numbers to, t\, t%, ... is very much like that 
for a random walk, in which the successive differences At n = i n +i — t n are identically distributed 
independent random numbers with mean zero. In particular, we believe that the following theorem 
will be true whenever the periodic function p(x) has average zero. However, the proof will apply 
only in the following very special case. 

Hypothesis. We now assume that p(x) is a step function which takes a constant 
value on each of the k intervals j/k < x < (j + l)/k. 

If xq is randomly chosen, it then follows easily that the successive steps 

At n = t n+ i - t n = p(x n ) 

actually are identically distributed independent random variables, which do not depend on the 
value of t n . (In fact At n depends only on the n-th entry in the base k expansion of xq.) Choose 
some number N > and define three sequences as follows. Let 

a n be the number of integers 1 < i < n with ti > N, 

b n the number of such integers with < N, and 

c n the number of such integers with ti < —N. 

Thus the associated frequencies a n /n, b n /n, c n /n will have sum equal to + 1. 
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THEOREM 6.1. Let p(x) be a step function as described above, not identically zero but 
with average f p(x) dx equal to zero. Then for arbitrary to and for Lebesgue almost every 
xq, the ratios b n /n associated with the orbit (xq, to) \— * (x±, t{) \— * ■ ■ ■ will converge to 
zero as n — > oo ; but a n /n and c n /n will not tend to any limit. In fact, these ratios 
vary so wildly that 

lim inf (a n /n) = < limsup (a n /n) = 1, 
with a similar statement for c n /n. 



In terms of the original cylinder map, this means that most orbits spend most of the time 
extremely close to one or the other of the two boundaries; very occasionally jumping from one 
boundary to the other but in such an irregular way that there is no limiting asymptotic measure. 

We are indebted to Harry Kesten and especially to Mikhail Lyubich for very substantial help 
with the proof. To begin the argument, note that for each to an d each x$ E R/Z the number 
L = limsup n ^ oc (a n /n) is well defined, and is invariant under finite permutations of the sequence 
of differences At n , or equivalently under finite permutations of the entries in the base k expansion 
of xo. Therefore, according to the Hewitt-Savage Zero-One Law, each set 

{xq ; L < constant} 



has measure either zero or one. (See Feller 2, 1966[ IV. 6].) It follows easily that L takes some 
constant value for almost all x$. 

First consider the case iV = 0. We will prove that L = 1 except on a set of measure zero. 
Choose some small e>0 and let X n (e) be the set of all xq for which a n /n>l — e. Evidently 
the intersection 

p| (X n (e) U X n+ i(e) U X n+2 (e) U • • •) 

n 

is precisely the set of xq for which L = limsup(a n /n) satisfies L > 1 — e. Thus, if we assume 
that L < 1 — e on a set of xo of positive (and hence full) measure, then it would certainly follow 
that the measure of X n {e) must tend to zero as n — > oo. But according to the Arcsine Law, the 
measure of X n {e) converges to the limit 

2 . , 2 . _ 

1 arcsin y 1 — e = — arcsin ye > 

7T 7T 

as n — > oo. (See |Feller 1, 1968| III. 4; and 2, 1966, XII. 8].) This contradiction proves that 
limsup(a n /n) = 1 almost everywhere. Since o n + c n < n, it follows that liminf (c n /n) = 0; and 
a similar argument with a n and c n interchanged completes the proof for the case N = 0. 

To complete the proof of Theorem 16.11 choosing some arbitrarily large N, we must show that 
the associated sequence b n /n converges to zero for almost all xq. The proof will be based on the 
following. 

Consider a random walk To, t%, ... on the circle M/Z, starting with some specified To, where 
the differences Ar n = r n +i — r n are identically distributed independent random variables which 
do not depend on r n . 
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LEMMA 6.2. The resulting sequence {r n } is uniformly distributed around the circle 
with probability +1 if and only if the following condition is satisfied: 

(*) There is no finite cyclic subgroup Gc K/Z such that Ar n £ G with probability +1. 



Proof. (Compare |Levy, 1939] , |Kawada and Ito, 1940| .) Clearly this subgroup condition 



is necessary. To prove that it is sufficient, let \i be the common probability distribution for the 
differences Ar n , let S C R/Z be the support of fi, and let 5 N = S x S x S x • • • be the 
space of sequences s = (so, s±, . . .) of elements of S, provided with the shift invariant measure 
/i N = fi x fj, x • • • . It is not hard to see that the skew product map 

F(t, s , si, . . .) = (r + s , si, s 2 , ...) 

from (R/Z) x ,S N to itself preserves the measure A x where A is Lebesgue measure. We will 
prove that F is ergodic; or equivalently that: 

Every bounded measurable F -invariant function <f> : (R/Z) x S —> R is constant 
almost everywhere. 

Given such a function (f> , let 

</>n(r, s , . . • , s n -i) = / <f)(r, s) dfi N (s n , s n+ i, . . .) 



be the average of 0(r, s) over all possible choices of s n , s n+ i, .... Clearly <p n can also be 
expressed as cin average over all choices of s n , 

<Pn(r, s , . . . , s„_i) = <p n+1 (r, s , . . . , s n ) dfi(s n ) . 



On the other hand, using the condition <j) = <j)oF of F-invariance, we see that (j) n (r, so, • • • , s n _ i) 
can also be described as the average of <^(r + so, si, S2, . . .) over all choices of s n -\-ij . . . ) . But 
by definition, this is equal to n _i(r + so, si, . . . , s ra _i). Thus, inductively, it follows that 

<p n (T, S , ... , S n _i) = (f> n -l(T + Sq, Si, . . . , S n _i) = <f) n -z(T + S + Si, S 2 , . . . , S n _i) = • • • 

= (j) (T + S -\ hs n _i). (13) 

In particular, note that 

<t>o (t) = J Mr, s)dfi(s) = J 4>o(r + s)dn{s). (14) 

If the condition (*) is satisfied, then we will use this last equation to prove that 0o is constant 
almost everywhere. Let 

= / e(-qs)d/i(s) 



be the Fourier transform of //, where g G Z, and where e(t) is an abbreviation for e 2mt . Thus 
/2(g) is a weighted average of points on the unit circle; hence \fi(q)\ < 1. For q ^ 0, the condition 
(*) guarantees that the weight is not all concentrated at points s such that qs = (mod Z), so 
it follows that fl(q) ^ 1. 



14 



Schwarzian Derivatives and Cylinder Maps 



The Fourier transform (fto(q) is denned similarly as the integral of e(— gr)^o( T ) dr. Recall 
that a bounded measurable function on the circle is constant almost everywhere if and only if its 
g-th Fourier coefficient is zero for every q ^ 0. Multiplying equation (|14[) by e (— qr) dr and then 
integrating, using the substitution r] = r + s, we obtain 

4>o(q) = J e(-qri) (f>o{v) d>V J e(qs)du(s) = (f> (q) ju(-g) . 

For every q ^ 0, since fi(—q) ^ 1, it follows that (fto(q) = 0. Therefore (fto takes some constant 
value v almost everywhere. Using equation (|13|) . it follows that every (ft n also takes the value v 
almost everywhere. 

We can now prove that (ft is constant almost everywhere. Suppose to the contrary, for example, 
that 0(t, s) > v + e > on a set £ of positive measure. Choose a point of density (r*, s*) 
for £ . Since (ft is bounded, it would follow that the average of (ft over a small neighborhood 
of (t*, s*) is strictly greater than v. But the fact that each (ft n equals v almost everywhere 
implies that every such average is also equal to v. Thus (ft must be constant almost everywhere; 
which proves ergodicity. 

Now the Birkhoff Ergodic Theorem implies that almost every orbit of F is uniformly distributed 
with respect to the measure A x /i N . Projecting to the first coordinate, it follows that almost every 
sequence 

r, r + s , r + so + si, ... 
is uniformly distributed with respect to A. This completes the proof of Lemma lH. 21 □ 

Proof of Theorem 16.11 (conclusion). Recall that b n is the number of < j < n with 
\tj\ < N . We must show that the ratio b n /n tends to zero with probability +1 as n — > oo . 
Choosing some large number L ^> N, the quotients t n /L (mod Z) form a random walk on 
the circle. If L is chosen so that the ratio At n /L is irrational with probability > 0, then by 
Lemma 16.21 these quotients are uniformly distributed around the circle. Let b n (L) > b n be the 
number of j £ [0, n) with tj congruent to an element of [-N, N] modulo UL. Then it follows 
that b n (L)/n converges to 2N/Las n — > oo . Since L can be arbitrarily large, this proves that 
b n /n — ► 0, as required. □ 



Appendix A: The Transverse Exponent. 

Let (x, l) be any point of the boundary circle A L , where i can be either or 1. By definition the 
transverse Lyapunov exponent along the circle A L at (x, l) is defined by 



1 



L y a PA( x ) = , lim 7 lo g -5 — ( x >y) 



dF 



ok 



dy 



evaluated at y = i , 



whenever this limit exists. (In this case, transverse really means normal.) Here F(x,y) = (kx, f x {y)) 
as usual. By the chain rule, the above expression can be written as 

Lyap A (x) = Hm - log (f' Xo (i)f Xl (i) • • • f' Xk _M) 



where xq i— ► x\ i— > 



is the orbit of x = xq. 
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Let us denote by A the 2-dimensional Lebesgue measure on the cylinder C = (R/Z) x I, and 
by X x the 1-dimensional Lebesgue measure along R/Z. Since \ x is ergodic and invariant under 
multiplication by k (see Equation (|T|))- it follows from the Birkhoff Ergodic Theorem that this 
transverse Lyapunov exponent is defined and independent of x for almost all x, and is equal to the 
integral 

LyaPA = J lo g (f'x(Q))dx, 

R/Z 

for almost all x. 

Let us prove now the Lemma stated in ^21 

LEMMA A.l. For i equal to zero or one, let B L be the attracting basin of the circle A L . 
If the transverse Lyapunov exponent 



Lyap(A) = J log {f' x {i)) dx (Al) 



is negative, then the basin B L has strictly positive measure. In fact, for almost every x £ 
R/Z the basin B L intersects the "fiber" x x I in an interval of positive length. On the other 
hand, if Lyap(.A t ) > then B L has measure zero. 

Proof. First consider the case l = with Lyap(^lo) < 0. By Taylor's expansion restricted to 
the fiber over x, we have 

f x (y) = fMy + o(y 2 ), 

uniformly for all (x,y) 6 (R/Z) x /. Choose K > so that, 

Uy) < y(f' x (0)+Ky) for all (x,y) . 
For any rj > 0, it follows that 

Uy) < y{f' x (0)+v) whenever y < -| . (A2) 
Since Lyap(^lo) < 0, we can choose rj > small enough so that 

J log(f' x (0) + r])dx < 0. (A3) 



It will be convenient to introduce the notation 

a(x) = log (/^(0) +??). (A4) 

Consider some orbit (xq, yo) \— > (x\, yi) i—* (x2, 2/2) By the Birkhoff Ergodic Theorem, the 

averages 

-(a(x ) + a(xi) + . . . + a(xn-i) ) 
n\ / 
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converge to Jjg/g a(x) dx < for almost all xq. In particular, it follows that the n-fold sum 

A n (x ) = a(x ) + a{x\) + . . . + a(x n -i) 
converges to negative infinity as n — > oo. Hence the maximum 

A max (x) = maxA n (x) 

n>0 

is certainly defined and finite for almost all x. Now suppose that 

y < H e - A ^ x °). (A5) 
K 

Then a straightforward induction shows that 

„ < Jt f> A n (x )-A lnax (x ) < V 

Vn ~ K 6 ~ K 

for all n. Since A n (xo) converges to — oo, it follows that y n tends to zero, so that (xo, yo) belongs 
to the attracting basin Bo- Since the right side of the inequality (jA5|) is a measurable function of 
xo, defined and strictly positive almost everywhere, it follows that its integral is strictly positive. 
Evidently this integral is a lower bound for the area of Bo. Thus Bo has positive measure as required. 

The proof for the case Lyap(^4o) > is completely analogous. However, it requires us to make 
use of the hypothesis that f' x (y) is strictly positive, even for y = 0, so that we can choose a small 
r\ with < rj < /^.(O) everywhere, and with 

J log (£(0) - rj) dx >0. (A6) 

The estimate (jA2j) is then replaced by 

fx(y) > y{f'x(°) ~ V) whenever y < — . (A7) 

Now suppose that the basin Bq has positive measure. Then, for a set of xo of positive measure, 
we could find orbits (xo, yo) l— * Ui) which satisfied < y n < for all n. But using 

(|A6|) and (|A7|) it is not hard to see that this is impossible. Therefore Bo has measure zero. The 
arguments for the basin B\ are completely analogous. □ 



Appendix B: Schwarzian Derivative and Cross-Ratios. 

Recall that the Schwarzian derivative Sf of a C 3 interval diffeomorphism was defined 

in Equation (jHJ). The statement Sf < will mean that the inequality Sf(y) < holds for y in 
a dense open subset of /; and similarly for Sf > (or Sf = ). 

PROPOSITION B.l. The Schwarzian derivative has the following properties: 

1. The sign of Sf is preserved under iteration of f . For example if Sf < and Sg < , 
then S(fog) < . 

2. Sf < if and only if Sf- 1 > . 

3. Sf < if and only if the function (f(y) = l/-v/|/'(y)| is concave upwards. 

4. Sf = if and only if f is a fractional linear transformation, f{x) = (ax + b)/(cx + d) 
where we may assume that ad — be = ± 1 . 
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Proof. 

1. A straightforward calculation shows that the Schwarzian derivative of a composition is given 
by the formula 

S(fog) = (g') 2 Sf + Sg, (Bl) 
and the conclusion follows easily. 

2. This follows by taking g = / _1 in equation (|B1|) and noting that the identity map has 
Schwarzian zero. 

3. It is not hard to calculate that the second derivative of (p(y) = 1/ \/\f'(y)\ satisfies the 
equation 

Sf = -2<f/'(x)Mx). (B2) 

Thus Sf(y) < on a dense open set if and only if ip"(y) > on a dense open set, and the 
assertion follows. 

4. Prom equation (|B2|) we see that the Schwarzian is zero if and only if the function <p(y) is 
linear, say (p(y) = cy + d or in other words 

f'(y) = ±\/{cy + df. 

Integrating, we see that this is true if and only if / is fractional linear. □ 

Now consider a C 3 -diffeomorphism /:/—>/' where I' may be a different interval of real 
numbers. A fixed point y = f(y) Sin/' will be called strictly attracting if \f'{y)\ < 1 and strictly 
repelling if |/'(y)| > 1. 

LEMMA B.2. Suppose that Sf < throughout a dense open subset of I. If f has two 

fixed points, then at least one must be strictly attracting. Furthermore, if there are three 
fixed points then the middle one must be strictly repelling and the other two must be strictly 
attracting. Such a map can never have four fixed points. 

Proof. Given two fixed points a < (3 , we will first show that at least one of the two must be 
strictly attracting. In fact, it follows from the Mean Value Theorem that some point a < x < (3 
must satisfy f'(y) = 1. If both f'(a) > 1 and /'(/?) > 1, then the graph of /' would have to have 
have a local minimum somewhere in the open interval (a, (3). Hence the function <p(y) would have 
a local maximum. Since (f"(y) > on a dense open set, this clearly leads to a contradiction. 

Now consider three fixed points a < (3 < 7. Then The Mean Value Theorem yields points 
y G (a, (3) and y' G (^,7) with f'(y) = f'{y') = 1. Again, if f'{(3) < 1, then the graph of /' would 
have a local minimum, yielding a contradiction. Therefore (3 is strongly repelling, hence a and 7 
must be strongly attracting. Evidently this leaves no possibility for a fourth fixed point. □ 

LEMMA B.3. If Sf < for an orientation preserving diffeomorphism f : I —* I, then 
/'(0)/'(l) < 1. 
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Proof. By the previous lemma, at least one of the two boundary fixed points must be strictly 
attracting: either /'(0) < 1, or /'(l) < 1, or both. Now consider the auxiliary function 

g(y) = l-/(l-y). 

Evidently g also has negative Schwarzian, with g'(0) = /'(l) and f'(0) = g'(l). The composition 
fog also has negative Schwarzian; hence fog has derivative less than one at at least one of the 
two endpoints. But the derivative at either endpoint is equal to the product /'(0)/'(l). It follows 
that this product is less than one, as required. □ 

Definition. The cross-ratio of four distinct real numbers will mean the expression 

/ \ (2/2 - 2/0X3/3 - s/i ) , d m 

PKVO, 2/1,2/2, 3/3) = 7 77 r. B3 

(3/1 - 3/o) (3/3 - 3/2) 

(The reader should take care, since conflicting notations are often used.) Note that 
P(V0, 2/1, 3/2, 3/3) > 1 whenever 3/0 < 3/1 < 3/2 < 2/3 • 

Evidently the cross ratio remains invariant whenever we replace each by ayi + b with a / 0. A 
brief computation shows that it also remains invariant when we replace each yi by 1 jyi . Since every 
fractional linear transformation can be expressed as a composition of affine maps and inversions, it 
follows that the cross-ratio is invariant under fractional linear transformations. 

We will say that a monotone map / increases cross-ratios if 

p(/(3/o), /(3/i), /(3/2), /(3/3)) > p(yo, 3/1, 3/2, 3/3) whenever 3/0 < 2/1 < 3/2 < 2/3 • 



LEMMA B.4. (Allwright.) Again let f : I I' be a C 3 -dijfeomorphism. Then f 
increases cross-ratios if and only if Sf < throughout some dense open subset of I. 

Remark. In Lemmas I PI E P1 and I PI note that we can obtain a corresponding statement 
for the case Sf > simply by applying the given statement to the inverse map from I' to I. For 
example: / decreases cross-ratios if and only if Sf > on a dense open set. 

Proof of Lemma Ef~4l (Compare |Allwright, 1978] .) First suppose that Sf<0 on a 
dense open set. Given points 3/0 < 3/1 < 3/2 < 3/3 , after composing / with a fractional linear 
transformation, we may assume that / fixes the three points 3/0, 3/1, 3/3- If Sf < on a dense 
set, then yo, 3/3 are attracting and yi is repelling bv Lemma I PI Since there can be no fixed point 
between y\ and 2/3, it follows that / moves every intermediate point to the right. Thus 7(3/2) > 2/2 , 
and it follows easily that / increases the cross-ratio p(yo, 3/1, 3/2, 3/3)- 

Conversely, if Sf is not negative on a dense open set, then it must either be strictly positive 
somewhere, or identically zero on some interval. In the first case, it would decrease some cross- 
ratio, and in the second case it would be fractional-linear and hence preserve cross-ratios within 
this interval. This completes the proof. □ 
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Remark B.5. If < y\ < yi < 1, then the Poincare distance between y\ and ?/2 within (0, 1) can 
be defined as 



1/2 

d[o,i](yi,lte) = J {^ + JT^y)) dy 



yi 



= logf ^j) m \ ) = log p(0,yx, 2/ 2) l)- 
^yi(i - 2/2)/ 

This can be identified with the usual Poincare distance within a complex disk having the interval 
[0,1] as diameter. In terms of the Poincare arclength coordinate of Equation (|11|). the Poincare 
distance formula can also be written as, 

d[o,i](l/i,lte) = |t(!fe)-f(l/i)|- (B4) 
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